Emanuele Iaccarino personal mark Emanuele Iaccarino personal mark E. Iaccarino
  • home
  • about
  • academic
  • projects
  • publications
  • awards & certifications
  • CV
  • get in touch!
    • LinkedIn
    • email

Churn Analysis on Banking Dataset

Machine Learning
Churn Prediction
Banking
Research

Predicting customer churn in the banking sector using boosting algorithms, Optuna optimization, and interpretable ML techniques.

Published

May 1, 2024

This project develops a customer churn prediction model for the banking sector, aiming to identify the top 10,000 clients most likely to close their accounts.

Methods
- Preprocessing: data cleaning, categorical encoding
- Sampling: cost-sensitive techniques to address class imbalance
- Boosting algorithms: LightGBM, XGBoost, CatBoost, with ensemble weighting
- Hyperparameter optimization: automated tuning with Optuna
- Interpretability: SHAP values for global and local feature importance

Key results
- Ensemble boosting models achieved the best performance using a custom Rank Probabilities metric, prioritizing recall of churners
- Synthetic data improved testing robustness on unseen distributions
- SHAP analysis revealed key socio-demographic and account features driving churn risk

View project on GitHub


Visualization of churn prediction and feature importance in banking dataset

Boosting models with interpretability for churn prediction

© 2025 Emanuele Iaccarino

 

This website is built with , , and Quarto